SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: SPEAR, Patricia G. 

x MONTGOMERY, Rebecca I . 

(ii) TITLE OF INVENTION: HERPES VIRUS ENTRY RECEPTOR PROTEIN 
(Hi) NUMBER OF SEQUENCES: 7 % 

(iV) °aS l ffiBS= A SSBSR > GOLDSMITH, SHORE I MILNAMOW 
?B) STREET: 180 N. STETSON, SUITE 4700 

(C) CITY: CHICAGO 

(D) STATE: ILLINOIS 

(E) COUNTRY: U.S.A 

(F) ZIP: 60601 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
B) COMPUTER: IBM PC co— *'■ 




(B1 COMPUTER: IBM conpataBxe 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS . . 

(D) SOFTWARE: Patentln Release #1.0, Versxon #1.30 



(vi) CURRENT APPLICATION DATA: 
11 (A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 
{ ' (M NAME: NORTHRUP, THOMAS E, 

(B) REGISTRATION NUMBER: 33,268 

(C) REFERENCE /DOCKET NUMBER : XX 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (312) 616-5400 
B) TELEFAX: (312) 616-5460 



TELEX: XX 



(2) INFORMATION FOR SEQ ID N0:1: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1719 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: Bat_peptide 

(B) LOCATION : 293 . . 1 1 89 

(ix) FEATURE: 

V ' (A) NAME/KEY: CDS 

(B) LOCATION: 293.. 11 92 

(ix) FEATURE: j • 

(A) NAME /KEY: sig_peptid 

(B) LOCATION: 293.. 406 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1 
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CCTTCATACC GGCCCTTCCC CTCGGCTTTG CCTGGACAGC TCTGCCTCCC GCAGGGCCCA 60 

CCTGTGTCCC CCAGCGCCGC TCCACCCAGC AGGCCT6AGC CCCTCTCTGC TGCCAGACAC 120 

CCCCTGCTGC CCACTCTCCT GCTGCTCGGG TTCTGAGGCA CAGCTTGTCA CACCGAGGCG 180 

GATTCTCTTT CTCTTTCTCT TCTGGCCCAC AGCCGCAGCA ATGGCGCTGA GTTCCTCTGC 240 

TGGAGTTCAT GCTGCTAGCT GGGTTCCCGA GCTGCCGGTC TGAGCCTGAG GC ATG 295 



1 



GAG CCT CCT GGA GAC TGG GGG CCT CCT CCC TGG AGA TCC ACC CCC AGA 343 
GlS p£ Pro GlJ Asp Trp Gly Pro Pro Pro Trp Arg Ser Thr Pro Arg 



| s E 



5 S 



ACC TAC ATT GCC CAC CTC AAT GGC CTA AGC AAG TGT CTG CAG TGC CAA 
Th? ™r lie Ala His Leu Asn Gly Leu Ser Lys Cys Leu Gin Cys Gin 

^ 35 90 y£> 

3 ATG TGT GAC CCA GCC ATG GGC CTG CGC GCG ACG CGG AAC TGC TCC AGG 
3 Met Cys Asp Pro Ala Met Gly Leu Arg Ala Thr Arg Asn Cys Ser Arg 



100 



391 



ACC GAC GTC TTG AGG CTG GTG CTG TAT CTC ACC TTC CTG GGA GCC CCC 
Thr Asp Val Leu Arg Leu Val Leu Tyr. Leu Thr Phe Leu Gly Ala Pro 

20 25 30 

TGC TAC GCC CCA GOT CTG CCG TCC TGC AAG GAG GAC GAG TAC CCA GTG 43? 
Cys Tyr Ala Pro Ala Leu Pro Ser Cys Lys Glu Asp Glu Tyr Pro Val 
35 40 45 

GGC TCC GAG TGC TGC CCC ACG TGC AGT CCA GGT TAT CGT GTG AAG GAG 487 
Gly" Ser Glu Cys Cys Pro Thr Cys Ser Pro Gly Tyr Arg Val Lys Glu 
50 55 60 

GCC TGC GGG GAG CTG ACG GGC ACA GTG TGT GAA CCC TGC CCT CCA GGC 
« Ala Cys Gly Glu Leu Thr Gly Thr Val Cys Glu Pro Cys Pro Pro Gly 

** 70 75 ou 



535 



583 



631 



679 



727 



823 



ACA GAG AAC GCC GTG TGT GGC TGC AGC CCA GGC CAC TTC TGC ATC GTC 
Thr Glu Asn Ala Val Cys Gly Cys Ser Pro Gly His Phe Cys He Val 
115 120 125 

CAG GAC GGG GAC CAC TGC GCC GGT GCC GCC GTT ACG CCA CCT CCA GCC 
Gin Asp Gly /Sp His Cys Ala Gly Ala Ala Val Thr Pro Pro Pro Ala 

CGG GCC AGA GGG TGC AGA AGG GAG GCA CCG AGA GTC AGG ACA CCC TGT 
Arg Ala Arg Gly Cys Arg Arg Glu Ala Pro Arg Val Arg Thr Pro Cys 

150 155 

GTC AGA ACT GCC CCC GGG GAC CTT CTC TCC AAT GGG ACC CTG GAG GAA 
vll Arg Thr Ala Pro Gly Asp Leu Leu Ser Asn Gly Thr Leu Glu Glu 

165 170 1 

TGT CAG CAC CAG ACC AAG TGC AGC TGG CTG GTG ACG AAG GCC GGA GCT 
Cys Gin His Gin Thr Lys Cys Ser Trp Leu Val Thr Lys Ala Gly AJ.a 
' 180 185 19° 

GGG ACC AGC AGC TCC CAC TGG GTA TGG TGG TTT CTC TCA GGG AGC CTC 
Gly Thr Ser Ser Ser His Trp Val Trp Trp Phe Leu Ser Gly Ser Leu 
195 200 205 

GTC ATC GTC ATT GTT TGC TCC ACA GTT GGC CTA ATC ATA TGT GTG AAA 
Val He Val He Val Cys Ser Thr Val Gly Leu He He Cys vax Lys 
210 215 220 

AGA AGA AAG CCA AGG GGT GAT GTA GTC AAG GTG ATC GTC TCC GTC CAG 1015 

36 
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Arg Arg Lys Pro Arg Gly Asp Val Val Ly» Val lie Val Ser Val Gin 

230 235 * w 



rcr aaa AftA GAG GAG GCA GAA GGT GAG GCC ACA GTC ATT GAG GCC CTG 
t£ Art GlS Glu Ala* Glu Gly Glu Ala Thr Val He Glu Ala Leu 
245 

r . r rrr rrr CCG 6AC GTC ACC ACG GTG GCC GTG.A6G AGA CAA TAC CCT 
GlS Ala" p£ a5r Val Thr Thr Val Ala Val Arg Arg Gin Tyr Pro 
260 265 

CAT TCA CGG GGA GGA GCC CAA ACC ACT GAG CCA CAG ACT CTG CAC CCC 
Sil Se^ Xrg GlJ Gly Ala Gin Thr Thr Asp Pro Gin. Thr Leu His Pro 

275 280 
GAC GCC AGA GAT ACC TGG AGC GAC GGC TGC TGA AAGAGGCTGT CCACCTGGCG 
Asp Ala Arg Asp Thr Trp Ser Asp Gly Cys * 

290 295 ogu 

AAACCACCGG AGCCCGGAGG CTTGGGGGCT CCGCCCTGGG CTGGCTTCCG TCTCCTCCAG 
TGGAGGGAGA GGTGGGGCCC CTGCTGGGGT AGAGCTGGGG ACGCCACGTG CCATTCCCAT 
GGGCCAGTGA GGGCCTGGGG CCTCTGTTCT GCTGTGGCCT GAGCTCCCCA GAGTCCTGAG 
GAGGAGCGCC AGTTGCCCCT CGCTCACAGA CCACACACCC AGCCCTCCTG GGCCAGCCCA 
GAGGGCCCTT CA6ACCCCAG CTGTCTGCGC GTCTGACTCT TGTGGCCTCA 6CAGGACAGG 
CCCCGGGCAC TGCCTCACAG CCAAGGCTGG ACTGGGTTGG CTGCAGTGTG GTGTTTAGTG 
GATACCACAT CGGAAGTGAT TTTCTAAATT GGATTTGAAT TCCGGTCCTG TCTTCTATTT 
GTCATGAAAC AGTGTATTTG GGGAGATGCT GTGGGAGGAT GTAAATATCT TGTTTCTCCT 
CAAAAAAAAA AAAAAAAAAA AAAAAAA 

(2) INFORMATION FOR SEO. ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

A) LENGTH: 299 amino acids 

B) TYPE: anino acid 
0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEO ID N0:2: 

Met Glu Pro Pro Gly Asp Trp Gly Pro Pro Pro Trp Arg Ser Thr Pro 

1 5 10 

Arg Thr Asp Val Leu Arg Leu Val Leu Tyr Leu Thr Phe Leu Gly Ala 

20 25 

Pro Cys Tyr Ala Pro Ala Leu Pro Ser Cys Lys Glu Asg Glu Tyr Pro 

35 

Val Gly Ser Glu Cys Cys Pro Thr Cys Ser Pro Gly Tyr Arg Val Lys 

50 55 

Glu Ala Cys Gly Glu Leu Thr Gly Thr Val Cys Glu Pro Cys Pro Pro 

65 70 '5 

Gly Thr Tyr He Ala His Leu Asn Gly Leu Ser Lys Cys Leu Gin Cys 

85 **** 

Gin Met Cys Asp Pro Ala Met Gly Leu Arg Ala Thr Arg Asn Cy Ser 

100 105 



1063 



1111 



1159 



1212 



1272 

1332 

1392 

1452 

1512 

1572 

1632 

1692 

1719 



Arg Thr Glu Asn Ala Val Cys Gly Cys Ser Pro Gly His Phe Cys II 
115 120 125 

Val Gin A p Gly Asp His Cys Ala Gly Ala Ala Val Thr Pro Pro Pro 
130 135 140 

Ala Arg Ala Arg Gly Cys Arg Arg Glu Ala Pro Arg Val Arg Thr Pro 
145 150 155 160 

Cys Val Arg Thr Ala Pro Gly Asp Leu Leu Ser Asn Gly Thr Leu Glu 

165 170 175 

Glu Cys Gin His Gin Thr Lys Cys Ser Trp Leu Val Thr Lys Ala Gly 
; 180 185 190 

Ala Gly Thr Ser Ser Ser His Trp Val Trp Trp Phe Leu Ser Gly Ser 
195 200 205 

Leu Val lie Val lie Val Cys Ser Thr Val Gly Leu lie lie Cys Val 
210 215 220 

Lys Arg Arg Lys Pro Arg Gly Asp Val Val Lys Val lie Val Ser Val 
225 230 235 240 

Gin Arg Lys Arg Gin Glu Ala Glu Gly Glu Ala Thr Val lie Glu Ala 

245 250 



Leu Gin Ala Pro Pro Asp Val Thr Thr Val Ala Val Arg Arg Gin Tyr 

260 265 270 

Pro His Ser Arg Gly Gly Ala Gin Thr Thr Asp Pro Gin Thr Leu His 
275 280 285 

Pro Asp Ala Arg Asp Thr Trp Ser Asp Gly Cys 
290 295 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
AACCCGGCTC GAGCGGCCGC T 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
GAATTCCACC ACACTTAAGG TG 



(2) INFORMATION FOR SEQ ID N0:5: 
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(i) SEQUENCE CHARACTERISTICS : 
'A) LENGTH: 19 base pairs 

B) TYPE: nucleic acid 

C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 
ACAAGACCGT TGCACCCTC 

(2) INFORMATION FOR SEQ ID N0:6: 

(i) SEQUENCE CHARACTERISTICS: 

A) LENGTH: 4619 base pairs 

B) TYPE: nucleic acid 



C) STRANDEDNESS: .single 
0) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 64.. 1320 

(ix) FEATURE: 

(A) NAME/KEY: natjeptide 

(B) LOCATION: 64.. 1317 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
AAGCTTGCAT GCCTGCAGGT CGACTCTAGC TGGGTTCCCG AGCTGCCGGT CTGAGCCTGA 

GGC ATG GAG CCT CCT GGA GAC TGG GGG CCT CCT CCC TGG AGA TCC ACC 
Met Glu Pro Pro Gly Asp Trp Gly Pro Pro Pro Trp Arg Ser Thr 
1 5 10 15 

CCC AGA ACC GAC GTC TTG AGG CTG GTG CTG TAT CTC ACC TTC CTG GGA 
Pro Arg Thr Asp Val Leu Arg Leu Val Leu Tyr Leu Thr Phe Leu Gly 

20 .25 30 

GCC CCC TGC TAC GCC CCA GCT CTG CCG TCC TGC AAG GAG GAC GAG TAC 
Ala Pro Cvs Tyr Ala Pro Ala Leu Pro Ser Cys Lys Glu Asp Glu Tyr 

§5 40 45 

CCA GTG GGC TCC GAG TGC TGC CCC ACG TGC AGT CCA GGT TAT CGT GTG 

Pro Val Gly Ser Glu Cys Cys Pro Thr Cys Ser Pro Gly Tyr Arg Val 

50 55 60 

AAG GAG GCC TGC GGG GAG CTG ACG GGC ACA GTG TGT GAA CCC TGC CCT 
Lys Glu Ala Cys Gly Glu Leu Thr Gly Thr Val Cys Glu Pro Cys Pro 
65 70 75 

CCA GGC ACC TAC ATT GCC CAC CTC AAT GGC CTA AGC AAG TGT CTG CAG 

Pro Gly Thr Tyr He Ala His Leu Asn Gly Leu Ser Lys Cys Leu Gin 

80 85 90 95 

TGC CAA ATG TGT GAC CCA GCC ATG GGC CTG CGC GCG ACG CGG AAC TGC 
Cvs Gin Met Cys Asp Pro Ala Met Gly Leu Arg Ala Thr Arg Asn Cys 

100 105. 110 

TCC AGG ACA GAG AAC GCC GTG TGT GGC TGC AGC CCA GGC CAC TTC TGC 
Ser Arg Thr Glu Asn Ala Val Cys Gly Cys Ser Pro Gly His Phe Cys 

115 120 125 

ATC GTC CAG GAC GGG GAC CAC TGC GCC GGT GCC GCC GTT ACG CCA CCT 
He Val Gin Asp Gly Asp His Cys^Ala Gly Ala Ala Val Thr Pro Pro 



130 



135 



140 
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CCA GCC CGG GCC AGA GGG TGC AGA AGG GAG GCA CCG AGA GTC AGG ACA 
Pro Ala Arg Ala Arg Gly Cys Arg Arg Glu Ala Pr Arg Val Arg Thr 
145 150 155 

CCC TGT GTC AGA ACT GCC CCC GGG GAC CTT CTC TCC AAT GGG ACC CTG 
Pro Cvs Val Arg Thr Ala Pro Gly Asp Leu Leu Ser Asn Gly Thr Leu 
160 165 170 175 

GAG GAA TGT CAG CAC CAG ACC AAG TGC AGA ATT CAC AAG ACC GTT GCA 
Glu Glu Cys Gin His Gin Thr Lys Cys Arg lie His Lys Thr Val Ala 

180 185 190 

CCC TCG ACA TGC AGC AAG CCC ACG TGC CCA CCC CCT GAA CTC CTG GGG 
Pro Ser Thr Cys Ser Lys Pro Thr Cys Pro Pro Pro Glu Leu Leu Gly 

195 200 205 

GGA CCG TCT GTC TTC ATC TTC CCC CCA AAA CCC AAG GAC ACC CTC ATG 
Glv Pro Ser Val Phe lie Phe Pro Pro Lys Pro Lys Asp Thr Leu Met 
3 210 215 220 

ATC TCA CGC ACC CCC GAG GTC ACA TGC GTG GTG GTG GAC GTG AGC CAG 
lie Ser Ara Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser Gin 
225 230 235 

GAT GAC CCC GAG GTG CAG TTC ACA TGG TAC ATA AAC AAC GAG CAG GTG 
Asp Asp Pro Glu Val Gin Phe Thr Trp Tyr lie Asn Asn Glu Gin Val 
240 245 250 255 

CGC ACC GCC CGG CCG CCG CTA CGG GAG CAG CAG TTC AAC AGC ACG ATC 
Ara Thr Ala Ara Pro Pro Leu Arg Glu Gin Gin Phe Asn Ser Thr lie 

260 265 270 

CGC GTG GTC AGC ACC CTC CCC ATC ACG CAC CAG GAC TGG CTG AGG GGC 
Ara Val Val Ser Thr Leu Pro lie Thr His Gin Asp Trp Leu Arg Gly 

275 280 285 

AAG GAG TTC AAG TGC AAA GTC CAC AAC AAG GCA CTC CCG GCC CCC ATC 
Lvs Glu Phe Lys Cys Lys Val His Asn Lys Ala Leu Pro Ala Pro lie 
290 295 300 

GAG AAA ACC ATC TCC AAA GCC AGA GGG CAG CCC CTG GAG CCG AAG GTC 
Glu Lys Thr lie Ser Lys Ala Arg Gly Gin Pro Leu Glu Pro Lys Val 
305 310 315 

TAC ACC ATG GGC CCT CCC CGG GAG GAG CTG AGC AGC AGG TCG GTC AGC 
Tyr Thr Met Gly Pro Pro Arg Glu Glu Leu Ser Ser Arg Ser Val Ser 
320 325 330 335 

CTG ACC TGC ATG ATC AAC GGC TTC TAC CCT TCC GAC ATC TCG GTG GAG 
Leu Thr Cys Met lie Asn Gly Phe Tyr Pro Ser Asp lie Ser Val Glu 

340 345 350 

TGG GAG AAG AAC GGG AAG GCA GAG GAC AAC TAC AAG ACC ACG CCG GCC 
Tro Glu Lvs Asn Gly Lys Ala Glu Asp Asn Tyr Lys Thr Thr Pro Ala 

355 360 365 

GTG CTG GAC AGC GAC GGC TCC TAC TTC CTC TAC AAC AAG CTC TCA GTG 
Val Leu Asp Ser Asp Gly Ser Tyr Phe Leu Tyr Asn Lys Leu Sep Val 
370 375 380 

CCC ACG AGT GAG TGG CAG CGG GGC GAC GTC TTC ACC TGC TCC GTG ATG 
Pro Thr Ser Glu Trp Gin Arg Gly Asp Val Phe Thr Cys Ser Val Met 
385 390 395 

CAC GAG GCC TTG CAC AAC CAC TAC ACG CAG AAG TCC ATC TCC CGC TCT 
His Glu Ala Leu His Asn His Tyr Thr Gin Lys Ser lie Ser Arg Ser 
400 405 410 415 
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636 



684 



732 



780 



828 



876 



924 



972 



1020 



1068 



1116 



1164 



4212 



1260 



1308 
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CCG GGT AAA TGA GCGCTGTGCC GGCGAGCTGC CCCTCTCCCT CCCCCCCACG 

Pro Gly Lys * 

CCGCAGCTGT GCACCCCGCA CACAAATAAA GCACCCAGCT CTGCCCTGAA CAGCTTCCGG 
TCTCCCTATA GTGAGTCGTA TTAATTTCGA TAAGCCAGCT GCATTAATGA ATCGGCCAAC 
GCGCGGGGAG AGGCGGTTTG CGTATTGGGC GCTCTTCCGC TTCCTCGCTC ACTGACTCGC 
TGCGCTCGGT CGTTCGGCTG CGGCGAGCGG TATCAGCTCA CTCAAAGGCG GTAATACGGT 
TATCCACAGA ATCAGGGGAT AACGCAGGAA AGAACATGTG AGCAAAAGGC CAGCAAAAGG 
CCAGGAACCG TAAAAAGGCC GCGTTGCTGG CGI I 1 1 1 CCA TAGGCTCCGC CCCCCTGACG 
AGCATCACAA AAATCGACGC TCAAGTCAGA GGTGGCGAAA CCCGACAGGA CTATAAAGAT 
ACCAGGCGTT TCCCCCTGGA AGCTCCCTCG TGCGCTCTCC TGTTCCGACC CTGCCGCTTA 
CCGGATACCT GTCCGCCTTT CTCCCTTCGG GAAGCGTGGC GCTTTCTCAT AGCTCACGCT 
GTAGGTATCT CAGTTCGGTG TAGGTCGTTC GCTCCAAGCT GGGCTGTGTG CACGAACCCC 
CCGTTCAGCC CGACCGCTGC GCCTTATCCG GTAACTATCG TCTTGAGTCC AACCCGGTAA 
GACACGACTT ATCGCCACTG GCAGCAGCCA CTGGTAACAG GATTAGCAGA GCGAGGTATG 
TAGGCGGTGC TACAGAGTTC TTGAAGTGGT GGCCTAACTA CGGCTACACT AGAAGGACAG 
TATTTGGTAT CTGCGCTCTG CTGAAGCCAG TTACCTTCGG AAAAAGAGTT GGTAGCTCTT 
GATCCGGCAA ACAAACCACC GCTGGTAGCG GTGG 1 1 1 I I I TGTTTGCAAG CAGCAGATTA 
CGCGCAGAAA AAAAGGATCT CAAGAAGATC CTTTGATCTT TTCTACGGGG TCTGACGCTC 
AGTGGAACGA AAACTCACGT TAAGGGATTT TGGTCATGAG ATTATCAAAA AGGATCTTCA 
CCTAGATCCT TTTAAATTAA AAATGAAGTT TTAAATCAAT CTAAAGTATA TATGAGTAAA 
CTTGGTCTGA CAGTTACCAA TGCTTAATCA GTGAGGCACC TATCTCAGCG ATCTGTCTAT 
TTCGTTCATC CATAGTTGCC TGACTCCCCG TCGTGTAGAT AACTACGATA CGGGAGGGCT 
TACCATCTGG CCCCAGTGCT GCAATGATAC CGCGAGACCC ACGCTCACCG GCTCCAGATT 
TATCAGCAAT AAACCAGCCA GCCGGAAGGG CCGAGCGCAG AAGTGGTCCT GCAACTTTAT 
CCGCCTCCAT CCAGTCTATT AATTGTTGCC GGGAAGCTAG AGTAAGTAGT TCGCCAGTTA 
ATAGTTTGCG CAACGTTGTT GCCATTGCTA CAGGCATCGT GGTGTCACGC TCGTCGTTTG 
GTATGGCTTC ATTCAGCTCC GGTTCCCAAC GATCAAGGCG AGTTACATGA TCCCCCATGT 
TGTGCAAAAA AGCGGTTAGC TCCTTCGGTC CTCCGATCGT TGTCAGAAGT AAGTTGGCCG 
CAGTGTTATC ACTCATGGTT ATGGCAGCAC TGCATAATTC TCTTACTGTC ATGCCATCCG 
TAAGATGCTT TTCTGTGACT GGTGAGTACT CAACCAAGTC ATTCTGAGAA TAGTGTATGC 
GGCGACCGAG TTGCTCTTGC CCGGCGTCAA TACGGGATAA TACCGCGCCA CATAGCAGAA 
CTTTAAAAGT GCTCATCATT GGAAAACGTT CTTCGGGGCG AAAACTCTCA AGGATCTTAC 
CGCTGTTGAG ATCCAGTTCG ATGTAACCCA CTCGTGCACC CAACTGATCT TCAGCATCTT 
TTACTTTCAC CAGCGTTTCT GGGTGAGCAA AAACAGGAAG GCAAAATGCC GCAAAAAAGG 
GAATAAGGGC GACACGGAAA TGTTGAATAC TCATACTCTT CCTTTTTCAA TATTATTGAA 



1360 



1420 

1480 

1540 

1600 

1660 

1720 

1780 

1840 

1900 

1960 

2020 

2080 
2140 
2200 
2260 
2320 
2380 
2440 
2500 
2560 
2620 
2680 
2740 
2800 
2860 
2920 
2980 
.3040 
3100 
3160 
3220 
3280 
3340 
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GCATTTATCA GGGTTATTGT CTCATGAGCG GATACATATT TGAATGTATT TAGAAAAATA 
AACAAATAGG GGTTCCGCGC ACATTTCCCC GAAAAGTGCC ACCTGACGTC TAAGAAACCA 
TTATTATCAT GACATTAACC TATAAAAATA GGCGTATCAC GAGGCCCTTT CGTCTCGCGC 
GTTTCGGTGA TGACGGTGAA AACCTCTGAC ACATGCAGCT CCCGGAGACG GTCACAGCTT 
GTCTGTAAGC GGATGCCGGG AGCAGACAAG CCCGTCAGGG CGCGTCAGCG GGTGTTGGCG 
GGTGTCGGGG CTGGCTTAAC TATGCGGCAT CAGAGCAGAT TGTACTGAGA GTGCACCATA 
TCGACGCTCT CCCTTATGCG ACTCCTGCAT t AGGAAGCAG CCCAGTAGTA GGTTGAGGCC 
GTTGAGCACC GCCGCCGCAA GGAATGGTGC AAGGAGATGG CGCCCAACAG TCCCCCGGCC 
ACGGGGCCTG CCACCATACC CACGCCGAAA CAAGCGCTCA TGAGCCCGAA GTGGCGAGCC 
CGATC7TCCC CATCGGTGAT GTCGGCGATA TAGGCGCCAG CAACCGCACC TGTGGCGCCG 
GTGATGCCGG CCACGATGCG TCCGGCGTAG AGGATCTGGC TAGTTATTAA TAGTAATCAA 
TTACGGGGTC ATTAGTTCAT AGCCCATATA TGGAGTTCCG CGTTACATAA CTTACGGTAA 
ATGGCCCGCC TGGCTGACCG CCCAACGACC CCCGCCCATT GACGTCAATA ATGACGTATG 
TTCCCATAGT AACGCCAATA GGGACTTTCC ATTGACGTCA ATGGGTGGAC TATTTACGGT 
AAACTGCCCA CTTGGCAGTA CATCAAGTGT ATCATATGCC AAGTACGCCC CCTATTGACG 
TCAATGACGG TAAATGGCCC GCCTGGCATT ATGCCCAGT A CATGACCTTA TGGGACTTTC 
CTACTTGGCA GTACATCTAC GTATTAGTCA TCGCTATTAC CATGGTGATG CGGTTTTGGC 
AGTACATCAA TGGGCGTGGA TAGCGGTTTG ACTCACGGGG ATTTCCAAGT CTCCACCCCA 
TTGACGTCAA TGGGAGTTTG TTTTGGCACC AAAATCAACG GGACTTTCCA AAATGTCGTA 
ACAACTCCGC CCCATTGACG CAAATGGGCG GTAGGCGTGT ACGGTGGGAG GTCTATATAA 
GCAGAGCTCT CTG6CTAACT AGAGAACCCA CTGCTTAACT GGCTTATCGA AATTAATACG 
ACTCACTATA 6GGAGACCC 

(2) INFORMATION FOR SEQ ID NO: 7: 

SEQUENCE CHARACTERISTICS : 
IA) LENGTH: 418 aoilno acids 
TYPE: aaino acid 
TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID 110:7: . 
Met Glu Pro Pro Gly Asp Trp Gly Pro Pro Pro Trp Arg Ser Thr Pro 

*| 5 

Arg Thr Asp Val Leu Arg Leu Val Leu Tyr Leu Thr Phe Leu Gly Ala 

20 25 . uw 

Pro Cys Tyr Ala Pro Ala Leu Pro Ser Cya Lys Glu Asg Glu Tyr Pro 

Val Glv Ser Glu Cys Cys Pro Thr Cys Ser Pro Gly Tyr Arg Val Lys 
50 55 60 

Glu Ala Cys Gly Glu Leu Thr Gly Thr Val Cys Glu Pro Cys Pro Pro 
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3400 

3460 

3520 

3580 

3640 

3700 

3760 

3820 

3880 

3940 

4000 

4060 

4120 

4180 

4240 

4300 

4360 

4420 

4480 

4540 

4600 

4619 
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65 



70 



75 
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Glv Thr Tyr He Ala His Leu Asn Gly Leu S r Lys Cys Leu Gin Cys 

85 90 ' -95 

Gin Met Cys Asp Pro Ala Met Gly Leu Arg Ala Thr Arg Asn Cys Ser 

100 105 110 

Ara Thr Glu Asn Ala Val Cys Gly Cys Ser Pro Gly His Phe Cys He 
" :>|t5 - 120 125 

Val Gin Asp Gly Asp His Cys Ala Gly Ala Ala Val Thr Pro Pro Pro 
130 135 140 

Ala Arg Ala Arg Gly Cys Arg Arg Glu Ala Pro Arg Val Arg Thr Pro 

Cvs Val Arg Thr Ala Pro Gly Asp Leu Leu Ser Asn Gly Thr Leu Glu 
* 165 170 175 

Glu Cvs Gin His Gin Thr Lys Cys Arg He His Lys Thr Val Ala Pro 

180 135 190 

S r Thr Cys Ser Lys Pro Thr Cys Pro Pro Pro Glu Leu Leu Gly Gly 

2 195 200 205 

Pr Ser Val Phe He Phe Pro Pro Lys Pro Lys Asp Thr Leu Met He 
210 215 220 

Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser Gin Asp 
* 230 235 240 



Asp Pro Glu Val Gin Phe Thr Trp Tyr lie Asn Asn Glu Gin Val Arg 

245 250 255 

Thr Ala Arg Pro Pro Leu Arg Glu Gin Gin Phe Asn Ser Thr He Arg 

260 265 270 

Val Val Ser Thr Leu Pro He Thr His Gin Asp Trp Leu Arg Gly Lys 

275 - 280 285 

Glu Phe Lvs Cvs Lys Val His Asn Lys Ala Leu Pro Ala Pro- He Glu 
290 295 300 

Lys Thr He Ser Lys Ala Arg Gly Gin Pro Leu Glu Pro Lys Val Tyr 

305 . 310 315 320 

Thr Met Gly Pro Pro Arg Glu Glu Leu Ser Ser Arg Ser Val Ser Leu 

325 330 335 

Thr Cys Met He Asn Gly Phe Tyr Pro Ser Asp He Ser Val Glu Trp 

340 345 - 350 

Glu Lys Asn Gly Lys Ala Glu Asp Asn Tyr Lys Thr Thr Pro Ala Val 

•»=•=• 360 365 



Leu Asp Ser Asp Gly Ser Tyr Phe Leu Tyr Asn Lys Leu Ser Val Pro 
370- 375 380 

Thr Ser Glu Trp Gin Arg Gly Asp Val Phe Thr Cys Ser Val Met His 

385 390 395 400 

Glu Ala Leu His Asn His Tyr Thr Gin Lys Ser He Ser Arg Ser Pro 

405 410 41o 

Gly Lys 
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